智能论文笔记

Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations

Ziyan Yang , Kushal Kafle , Franck Dernoncourt , Vicente Ordóñez Román

分类：计算机视觉 | 自然语言处理 | 机器学习

2022-06-30

我们建议对视觉模型预处理的基于利润的损失，以鼓励基于梯度的解释，这些解释与区域级注释一致。我们将该目标称为注意面罩的一致性（AMC），并证明它与依赖于区域级注释的模型相比，它产生了卓越的视觉接地性能，以显式训练对象检测器，例如更快的R-CNN。 AMC通过鼓励基于梯度的解释掩盖来工作，该掩盖的注意力分数主要集中在包含这种注释的图像的注释区域中。尤其是，在标准视觉建模目标之上接受AMC训练的模型在FlickR30K视觉接地基准中获得了86.59％的最新精度，与最佳先前模型相比，绝对改善了5.48％。我们的方法在既定的基准中都表现出表达理解，并通过设计基于梯度的解释来更好地与人类注释保持一致，从而提供了极大的表现。

translated by 谷歌翻译

Removing Objects From Neural Radiance Fields

Silvan Weder , Guillermo Garcia-Hernando , Aron Monszpart , Marc Pollefeys , Gabriel Brostow , Michael Firman , Sara Vicente

分类：计算机视觉

2022-12-22

Neural Radiance Fields (NeRFs) are emerging as a ubiquitous scene representation that allows for novel view synthesis. Increasingly, NeRFs will be shareable with other people. Before sharing a NeRF, though, it might be desirable to remove personal information or unsightly objects. Such removal is not easily achieved with the current NeRF editing frameworks. We propose a framework to remove objects from a NeRF representation created from an RGB-D sequence. Our NeRF inpainting method leverages recent work in 2D image inpainting and is guided by a user-provided mask. Our algorithm is underpinned by a confidence based view selection procedure. It chooses which of the individual 2D inpainted images to use in the creation of the NeRF, so that the resulting inpainted NeRF is 3D consistent. We show that our method for NeRF editing is effective for synthesizing plausible inpaintings in a multi-view coherent manner. We validate our approach using a new and still-challenging dataset for the task of NeRF inpainting.

translated by 谷歌翻译

Reinforcement Learning in System Identification

Jose Antonio Martin H. , Oscar Fernandez Vicente , Sergio Perez , Anas Belfadil , Cristina Ibanez-Llano , Freddy Jose Perozo Rondon , Jose Javier Valle , Javier Arechalde Pelaz

分类：机器学习 | 人工智能

2022-12-14

System identification, also known as learning forward models, transfer functions, system dynamics, etc., has a long tradition both in science and engineering in different fields. Particularly, it is a recurring theme in Reinforcement Learning research, where forward models approximate the state transition function of a Markov Decision Process by learning a mapping function from current state and action to the next state. This problem is commonly defined as a Supervised Learning problem in a direct way. This common approach faces several difficulties due to the inherent complexities of the dynamics to learn, for example, delayed effects, high non-linearity, non-stationarity, partial observability and, more important, error accumulation when using bootstrapped predictions (predictions based on past predictions), over large time horizons. Here we explore the use of Reinforcement Learning in this problem. We elaborate on why and how this problem fits naturally and sound as a Reinforcement Learning problem, and present some experimental results that demonstrate RL is a promising technique to solve these kind of problems.

translated by 谷歌翻译

Runtime data center temperature prediction using Grammatical Evolution techniques

Marina Zapater , José L. Risco-Martín , Patricia Arroba , José L. Ayala , José M. Moya , Román Hermida

分类：人工智能

2022-11-11

Data Centers are huge power consumers, both because of the energy required for computation and the cooling needed to keep servers below thermal redlining. The most common technique to minimize cooling costs is increasing data room temperature. However, to avoid reliability issues, and to enhance energy efficiency, there is a need to predict the temperature attained by servers under variable cooling setups. Due to the complex thermal dynamics of data rooms, accurate runtime data center temperature prediction has remained as an important challenge. By using Gramatical Evolution techniques, this paper presents a methodology for the generation of temperature models for data centers and the runtime prediction of CPU and inlet temperature under variable cooling setups. As opposed to time costly Computational Fluid Dynamics techniques, our models do not need specific knowledge about the problem, can be used in arbitrary data centers, re-trained if conditions change and have negligible overhead during runtime prediction. Our models have been trained and tested by using traces from real Data Center scenarios. Our results show how we can fully predict the temperature of the servers in a data rooms, with prediction errors below 2 C and 0.5 C in CPU and server inlet temperature respectively.

translated by 谷歌翻译

Portable Multi-Hypothesis Monte Carlo Localization for Mobile Robots

Alberto Garcia , Francisco Martin , Jose Miguel Guerrero , Francisco J. Rodriguez , Vicente Matellan

分类：机器人

2022-09-15

自我定位是一种基本功能，移动机器人导航系统集成到使用地图从一个点转移到另一点。因此，任何提高本地化精度的增强对于执行精致的灵活性任务至关重要。本文描述了一个新的位置，该位置使用Monte Carlo定位（MCL）算法维护几个颗粒人群，始终选择最佳的粒子作为系统的输出。作为新颖性，我们的工作包括一种多尺度匹配匹配算法，以创建新的MCL群体和一个确定最可靠的指标。它还贡献了最新的实现，从错误的估计或未知的初始位置增加了恢复时间。在与NAV2完全集成的模块中评估了所提出的方法，并与当前的最新自适应ACML溶液进行了比较，从而获得了良好的精度和恢复时间。

translated by 谷歌翻译

Multi-Modality Cardiac Image Computing: A Survey

Lei Li , Wangbin Ding , Liqun Huang , Xiahai Zhuang , Vicente Grau

分类：计算机视觉

2022-08-26

多模式心脏成像在心血管疾病患者的治疗中起关键作用。它允许互补的解剖学，形态学和功能信息，提高诊断准确性，并提高心血管干预和临床结果的疗效。多模式心脏图像的完全自动化处理和定量分析可能会对临床研究和基于证据的患者管理产生直接影响。但是，这些需要克服重大挑战，包括模式间未对准和寻找最佳方法来整合来自不同模式的信息。本文旨在对心脏病学，计算方法，验证策略，相关临床工作流程和未来观点的多模式成像进行全面综述。对于计算方法，我们对这三个任务（即注册，融合和分割）有利，通常涉及多模式成像数据，\ textit {结合来自不同模式的信息或跨模态传输信息的信息}。该评论强调，多模式性心脏成像数据具有广泛适用性的诊所，例如跨体瓣植入指南，心肌生存能力评估和导管消融疗法及其患者选择。然而，许多挑战仍未解决，例如缺失模态，成像和非成像数据的组合以及统一的分析和不同方式的表示。定义完善的技术如何适合临床工作流程以及它们引入了多少其他相关信息，这也有工作要做。这些问题可能会继续是一个积极的研究领域，并且将来要回答的问题。

translated by 谷歌翻译

Deep Computational Model for the Inference of Ventricular Activation Properties

Lei Li , Julia Camps , Abhirup Banerjee , Marcel Beetz , Blanca Rodriguez , Vicente Grau

分类：计算机视觉

2022-08-08

患者特异性的心脏计算模型对于使用数字双胞胎的精密医学和silico临床试验的有效实现至关重要。心脏数字双胞胎可以为个别患者提供心脏功能的非侵入性特征，因此对于患者特定的诊断和治疗分层有希望。然而，目前的解剖学和功能性孪生阶段的工作流，指的是模型解剖结构和临床数据的参数的推断，并不足够有效，稳健且准确。在这项工作中，我们提出了一个基于深度学习的特定于患者的计算模型，该模型可以融合解剖学和电生理信息，以推理心室激活特性，即传导速度和根节点。激活特性可以提供对心脏电生理功能的定量评估，以指导介入。我们采用Eikonal模型来生成具有地面真实特性的模拟心电图（ECG），以训练推理模型，在此还考虑了特定的患者信息。为了进行评估，我们在模拟数据上测试模型，并以快速的计算时间获得通常有希望的结果。

translated by 谷歌翻译

e-Genia3 An AgentSpeak extension for empathic agents

Joaquin Taverner , Emilio Vivancos , Vicente Botti

分类：人工智能

2022-08-01

在本文中，我们介绍了e-genia3代理商的扩展，以为移情剂的发展提供支持。新扩展程序修改了代理商的推理过程，以根据分析事件以及代理商的情感状态和个性选择计划。此外，我们的建议允许软件代理通过两个不同的事件评估过程模拟自我和其他代理之间的区别：移情评估过程，以使情绪作为对其他代理情绪的反应以及其他非情感评估过程的反应，并为其他非情感评估过程 - 同情情感事件。移情调节过程适应了基于人际因素（例如，代理人的人格和情感记忆）和代理人的人际特征（例如，代理人之间的情感联系），适应引起的同理心情绪。使用过去事件的记忆及其相应的引起的情绪，可以保持情感联系，以支持代理之间的长期移情互动。

translated by 谷歌翻译

Ethics for social robotics: A critical analysis

Júlia Pareto Boada , Begoña Román Maestre , Carme Torras

分类：机器人

2022-07-25

社会机器人技术的发展和欧洲前景实践的发展，将这些基于AI的系统纳入机构医疗保健环境中，要求（重新）根据人类价值和权利来配置我们的实践生活。尽管人们对社会机器人技术的道德含义越来越关注，但目前对其中心分支之一的辩论社会辅助机器人技术（SAR）仍取决于一种贫穷的道德方法。本文介绍并研究了这种现行方法的一些趋势，这些趋势已被批判文献综述所确定。基于对道德反思如何导致社会机器人技术的代表性案例的分析，概述了一些未来的研究行，这可能有助于重塑和加深其道德意义。

translated by 谷歌翻译

Experimental Evaluation of Visual-Inertial Odometry Systems for Arable Farming

Javier Cremona , Román Comelli , Taihú Pire

分类：机器人

2022-06-10

农业行业不断寻求农业生产中涉及的不同过程的自动化，例如播种，收获和杂草控制。使用移动自主机器人执行这些任务引起了极大的兴趣。耕地面向同时定位和映射（SLAM）系统（移动机器人技术的关键）面临着艰巨的挑战，这是由于视觉上的难度，这是由于高度重复的场景而引起的。近年来，已经开发了几种视觉惯性遗传（VIO）和SLAM系统。事实证明，它们在室内和室外城市环境中具有很高的准确性。但是，在农业领域未正确评估它们。在这项工作中，我们从可耕地上的准确性和处理时间方面评估了最相关的最新VIO系统，以便更好地了解它们在这些环境中的行为。特别是，该评估是在我们的车轮机器人记录的大豆领域记录的传感器数据集中进行的，该田间被公开发行为Rosario数据集。评估表明，环境的高度重复性外观，崎terrain的地形产生的强振动以及由风引起的叶子的运动，暴露了当前最新的VIO和SLAM系统的局限性。我们分析了系统故障并突出观察到的缺点，包括初始化故障，跟踪损失和对IMU饱和的敏感性。最后，我们得出的结论是，即使某些系统（例如Orb-Slam3和S-MSCKF）在其他系统方面表现出良好的结果，但应采取更多改进，以使其在某些申请中的农业领域可靠，例如作物行的土壤耕作和农药喷涂。。

translated by 谷歌翻译